Datasets and software

June 7, 2026, 11:19 a.m.

FLEURS-Badini

June 1, 2026, 10:25 a.m.

FLEURS-Badini* is the first benchmark for machine translation, speech translation, and speech recognition in the Badini dialect. It extends the FLEURS dataset by providing Badini translations and spe…

Details

Common Voice Badini

June 1, 2026, 10:20 a.m.

Common Voice Badini* is a transliterated version of the Northern Kurdish portion of the Common Voice dataset, revised to conform to standard Badini orthography. This resource aims to facilitate the r…

Details

Badini ASR Benchmark

June 1, 2026, 10:13 a.m.

The *Badini ASR Benchmark* is the first multi-domain speech recognition benchmark designed and recorded for the Badini variant of the Kurdish language. The dataset has been manually validated, and al…

Details